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Abstract: We consider concepts and models for measuring inequality in the 
distribution of resources with a focus on how inequality varies as a function of 
covariates. Lorenz introduced a device for measuring inequality in the distri- 
bution of income that indicates how much the incomes below the u*** quantile 
fall short of the egalitarian situation where everyone has the same income. 
Gini introduced a summary measure of inequality that is the average over u of 
the difference between the Lorenz curve and its values in the egalitarian case. 
More generally, measures of inequality are useful for other response variables 
in addition to income, e.g. wealth, sales, dividends, taxes, market share and 
test scores. In this paper we show that a generalized van Zwet type dispersion 
ordering for distributions of positive random variables induces an ordering on 
the Lorenz curve, the Gini coefficient and other measures of inequality. We 
use this result and distributional orderings based on transformations of distri- 
butions to motivate parametric and semiparametric models whose regression 
coefficients measure effects of covariates on inequality. In particular, we extend 
a parametric Pareto regression model to a flexible semiparametric regression 
model and give partial likelihood estimates of the regression coefficients and 
a baseline distribution that can be used to construct estimates of the various 
conditional measures of inequality. 



1. Introduction 

Measures of inequality provide quantifications of how much the distribution of a 
resource Y deviates from the egalitarian situation where everyone has the same 
amount of the resource. The coefficients in location or location-scale regression 
models are not particularly informative when attention is turned to the influence 
of covariates on inequality. In this paper we consider regression models that are 
not location-scale regression models and whose coefficients are associated with the 
effect of covariates on inequality in the distribution of the response Y. 

We start in Section 2.1 by discussing some familiar and some new measures of 
inequality. Then in Section 2.2 we relate the properties of these measures to a sta- 
tistical ordering of distributions based on transformations of random variables that 
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is equivalent to defining the distribution H of the response Z to have more resource 
inequahty than the distribution F of Y if Z has the same distribution as q(Y)Y 
for some positive nondecreasing function q{-). Then we show that this ordering im- 
phes the corresponding ordering of each measure of inequahty. We also consider 
orderings of distributions based on transformations of distribution functions and 
relate them to inequality. These notions and results assist in the construction of 
regression models with coefficients that relate to the concept of inequality. 

Section 3 shows that scaled power transformation models with the power pa- 
rameter depending on covariates provide regression models where the coefficients 
relate to the concept of resource inequality. Two interesting particular cases are 
the Pareto and the log normal transformation regression models. For these models 
the Lorenz curve for the conditional distribution of Y given covariate values takes 
a particularly simple and intuitive form. We discuss likelihood methods for the 
statistical analysis of these models. 

Finally, in Section 4 we consider scmiparametric Lchmann and Cox typo models 
that are based on power transformations of a baseline distribution Fq, or of 1 — Fqj 
where the power parameter is a function of the covariates. In particular, we consider 
a power transformation model of the form 

(1-1) F{y) = l-{l-Fo{y)r^-\ 

where a{x) is a parametric function depending on a vector jS of regression coeffi- 
cients and an observed vector of covariates x. This is an extension of the Pareto 
regression model to a flexible semiparametric model. For this model we present 
theoretical and empirical formulas for inequality measures and point out that com- 
putations can be based on available software. 



2. Measures of inequality and spread 

2. 1 . Defining curves and measures of inequality 

The Lorenz curve (LC) is defined (Lorenz [19]) to be the proportion of the total 
amount of wealth that is owned by the "poorest" 100 x u percent of the population. 
More precisely, let the random income y > have the distribution function F(ij), 
let F~^{y) = inf{y : F{y) > u) denote the left inverse, and assume that < /U < oo, 
where /U = E{Y). Then the LC (see e.g. Gastwirth [14]) is defined by 

(2.1) L{u) = Lf{u) = 11-^ [ F-'^{s)ds, < u < 1. 

Jo 

Let I^A} denote the indicator of the event A. For F continuous we can write 

(2.2) L{u) = ^,-^E{YI^Y<F-^u)}}• 

When the population consists of incomes of people, the LC measures deviation 
from the egalitarian case L{u) = u corresponding to where everyone has the same 
income a > and the distribution of Y is degenerate at a. The other extreme occurs 
when one person has all the income which corresponds to L{u) = 0, < m < 1. The 
intermediate case where Y is uniform on [0, 6], 6 > 0, corresponds to L{v) = u^. In 
general L{u) is non-decreasing, convex, below the line L(u) = u, < u < 1, and 
the greater the "distance" from u, the greater is the inequality in the population. If 
the population consists of companies providing a certain service or product, the LC 
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measures to what extent a few companies dominate the market with the extreme 
case corresponding to monopoly. 

A closely related curve is the Bonferroni curve {BC) B{u) which is defined 
(Aaberge [1], [2]), Giorgi and Mondani [15], Csorgo, Gastwirth and Zitikis [11]) 
as 



(2.3) 



B{u) = Bf{u) = u-^L{u), < u < 1. 



When F is continuous the BC is the LC except that truncation is replaced by 
conditioning 



(2.4) 



B{u) ^ ^i-^E{Y\Y < F'^{u)}. 



The BC possesses several attractive properties. First, it provides a convenient 
alternative interpretation of the information content of the Lorenz curve. For a 
fixed u, B{u) is the ratio of the mean income of the poorest 100 x u percent of the 
population to the overall mean. Thus, the BC may also yield essential information 
on poverty provided that we know the poverty rate. Second, the BC of a uniform 
(0,a) distribution proves to be the diagonal line joining the points (0,0) and (1,1) 
and thus represents a useful reference line, in addition to the two well-known stan- 
dard reference lines. The egalitarian reference line coincides with the horizontal line 
joining the points (0,1) and (1,1). At the other extreme, when one person holds all 
income, the BC coincides with the horizontal axis except for u= 1. 

In the next subsection we will consider ordering concepts from the statistics 
literature. Those concepts motivate the introduction of the following measures of 
concentration 



(2.5) C{u) = Cf{u) 



and 



(2.6) D{u) = Df{u) = - f 

u Jo 



ri 


F-i(s)i 


L 


F-^{u)\ 


/'■ 




Jo 


lF-^{u)\ 



ds = 



Lf{u) 
F-^{uy 



< u < 1 



ds = fiF 



Bf{u) 
F-^{u)' 



< M < 1. 



Accordingly, D(u) emerges by replacing the overall mean /x in the dominator of 
B(u) by the v}^ quantile y„ = F~'^{u) and is equal to the ratio between the mean 
income of those with lower income than the u*^ quantile and the u-quantile income. 
Thus, C(u) and D{u) measure inequality in income below the quantile. They 
satisfy C{u) < u, D{u) < 1, < u < 1, and C(u) equals u and while D{u) equals 
1 and in the egalitarian and extreme non-egalitarian cases, respectively, and they 
equal u/2 and 1/2 in the uniform case. 

To summarize the information content of the inequality curves we recall the 
following inequality indices 



(2.7) 



G 



Jo 



u — L{u)}du (Gini), B 



Jo 



1 — B{u)}du (Bonferroni), 



(2.8) 



C = 2 [ {u- C{u)}du, D= [ {1 - D{u)}du. 
Jo Jo 



These indices measure distances from the curves to their values in the egalitarian 
case, take values between and 1 and are increasing with increasing inequality. If 
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all units have the same income then G = B = C = D = 0, and in the extreme 
non-egalitarian case where one unit has all the income and the others zero, G = 
B = C = D = 1. When F is uniform on [O^b], B = C = D = 1/2 and G = 1/3. The 
inequality curves L{u), B{u),C{u), D{u), and the inequality measures G,B,C and 
D are scale invariant; that is, they remain the same if Y is replaced by aY, a > 0. 

2.2. Ordering inequality by transforming variables 

When we are interested in how covariates influence inequality we may ask whether 

larger values of a covariatc lead to more or less inequality. For instance, is there 
less inequality among the higher educated? To answer such questions we consider 
orderings of distributions on the basis of inequality, see e.g. Atkinson [5], Shorrocks 
and Foster [26], Dardanoni and Lambert [12], Mulierc and Scarsini [20], Yitzhaki 
and Olkin [29], Zoli [30], and Aaberge [3]. In statistics and reliability engineer- 
ing, orderings are plentiful, e.g. Lehmann [18] , van Zwet [27] , Barlow and Prochan 
[6], Birnbaum, Esary and Marshall [9], Doksum [13], Yanagimoto and Sibuya [28], 
Bickel and Lehmann [7], [8], Rojo and He [21], Rojo [22] and Shaked and Shan- 
thikumar [25]. In statistics, similar orderings are often discussed in terms of spread 
or dispersion. Thus, for non-negative random variables, we could define Y to have 
a distribution which is more spread out to the right than that of Yq if Y can 
be written as Y = h{Yo) for some non-negative, nondecreasing convex function h 
(using van Zwet [27]). It turns out to be more general and more convenient to re- 
place "convex" with "starshaped" (convex functions h are starshaped and concave 
functions g are anti-starshaped provided g{0) = h{0) = 0). 

Recall that a nondecreasing function g defined on the interval / C [0, oo), is 
starshaped on / if g{Xx) < \g{x) whenever x ^ I,Xx ^ I and < A < 1. Thus if 
/ = (0, oo), for any straight line through the origin, then the graph of g initially lies 
on or below it, and then lies on or above it. If g{\x) > Ag(x), g is anti-starshaped. 
On the class JF of continuous and strictly increasing distributions F with F{0) = 0, 
Doksum [13] introduced the following partial ordering F <* (F is starshaped 
with respect to H) if H~^F is starshaped on {x : < F{x) < 1}. This ordering was 
also considered by Yanagimoto and Sibuya [28] and Bickel and Lehmann [7,8]. Note 
that if F <, and X has distribution F, then Z = H~^[F{X)] has distribution H 
and is a starshaped transformation of X. Moreover, 

Proposition 2.1. Suppose that X and Z have distributions F and H, where F, H €: 
J-. Then F H if and only if there exists a positive nondecreasing function q{-) 
on {x -.0 < F{x) < 1} such that Z has the same distribution as q{X)X. 

Proof. Suppose that F H] then q{x) = H^^{F{x))/x will do because the star- 
shaped condition g{Xx) < Xx is equivalent to [g{Xx)/Xx] < g{x)/x. That is, 

(2.9) F <^ H 4^ H~^{F{x))/x is nondecreasing. 

Next, suppose that q{-) is positive and nondecreasing and that Z = q{X)X . Set 
h{x) = q{x)x. Then h{x) is increasing and P(X < x) = P{h{X) < h(x)). It follows 
that F{x) = H{q{x)x). That is, q{x) = H-'^{F{x))/x. □ 

Because of this proposition we say that if F <* if then F is a more egalitarian 
distribution of resources than H. 

We next show that the preceding definition of inequality leads to the correspond- 
ing ordering of the inequality curves Cf{-) and Dp{-) as well as of the indices C 
and D. 
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Proposition 2.2. Suppose that F, H G and F H. Then Cf{u) > Ch{u) and 
Df{u) > Dh{u), < u < 1. Therefore Cp < Ch and Dp < Dh- 

Proof. It follows from 12. 9() . setting u = F{x),v = F{x'), x < x' , that H^^{u)/ 
H^^iv) < F^^{u)/F^^{v), for < u < V < 1. If we integrate this inequality over 
u e {0,v), we obtain Cf{v) > Ch{v), < tj < 1. The other inequalities follow from 
this. □ 

The same order preservation as stated in Proposition 2.2 holds for L{u) and 
Biu). 

Theorem 2.1. Suppose that F,He!F and F H. Then Lp{u) > Lh{u) and 
Bf{u) > Bh{u), < u < 1. Moreover, Bp < Bh and Gp < Gh- 

Proof. Let 

(2.10) a = ^ H-^{v)dvj^ F-^{v)dv, 

and consider the line y = ax through the origin. Then H^^{F(x)) initially lies on 
or below this line, say up to the point x = b. Thus 



H-^{v)dv = i H-\F{x))dF{x)< / axdF{x) 

"'0 "'0 



a / F-'^{v)dv 



(2.11) 



for all w < ^(6) which establishes Lp{u) > Lh{u) for u < F{b). On the other 
hand, for x > b, y — H^^(F{x)) is above y — ax. Thus, for u > F(b) 

.F-\u) 



{u)= [ax- H-^{F{x))\dF{2 

Jb 



is a negative, decreasing function of u. We can write, for u > F{b), 

pu pu pF(b) pF(b) 

a / F-^{v)dv- / H-\v)dv = a F-\v)dv- / H-^{v)dv + s{u) 



Jo Jo Jo Jo 

= c + s{u) 

where c is nonnegative by H2.11|l . It follows that c+s{u) is a decreasing function that 
equals when u = 1 by the definition of a. Thus, c + s{u) > which establishes 
Lp{u) > Lh{u) again by the definition of a. The other inequalities follow from 
this. □ 



2.3. Ordering inequality by transforming distributions 

A partial ordering on T based on transforming distributions rather than random 
variables is the following: F represents more equality than H (F >e H) if 

H{z) - g{F{z)) 

for some nonnegative increasing concave function g on [0, 1] with g(0) — and 
g{l) = 1. In other words, F <e H ii F{H^^) is convex. If F is uniform, the 
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orderings F >g H implies F <^ H and in this case the results of Propositions 2.1, 
2.2 and Theorem 2.1 hold. Note that when F and H have densities / and h, then 
h{z) = g'{F{z))f{z) where g'F is decreasing. That is, F >^ H means that F has 
relatively more probability mass on the right than H. 

A similar ordering involves F{x) = 1 — F{x) and H{z) = 1 — H{z). In this case 
we say that F represents a more equal distribution of resources than H {F >r H) 



for some nonegative increasing convex transformation g on [0, 1] with g{Q) = and 
.9(1) = 1. In this case, if densities exist, they satisfy h{z) = g' (F (z)) f {z) , where 
g'F is decreasing. That is, relative to F, H has mass shifted to the left. 

Remark. Orderings of inequality based on transforming distributions can be re- 
stated in terms of orderings based on transforming random variables. Thus F >f,H 
is equivalent to the distribution function oiV = F{Z) being convex when X F 
and Z r-. H. 

3. Regression inequality models 
3.1. Notation and introduction 

Next consider the case where the distribution of Y depends on covariates such as 
education, work experience, status of parents, sex, etc. Let Xi,. . . ,Xd denote the 
covariates. We include an intercept term in the regression models, which makes it 
convenient to write X= {l,X-i, . . . ,Xd)^ ■ Let F{y\x) denote the conditional dis- 
tribution of Y given X = x and define the quantile regression function as the left 
inverse of this distribution function. The key quantity is 



With this notation we can write the regression versions of the Lorenz curve, for 
< u < 1 as 



Similarly, C{u\x), D{u\x) and the summary coefficients G{x), B{x), C{x) and 

D{x) are defined by replacing F{y) by F{y\x). Note that estimates of F{y\x) and 
fj,{y\x) provide estimates of the regression versions of the curves and measures 
of inequality. Thus, the rest of the paper discusses regression models for F{y\x) 
and /i(t/|cc). Using the results of Section 2, these models are constructed so that 
the regression coefficients reflect relationships between covariates and measures of 
inequality. 

3.2. Transformation regression models 

Let lo with distribution Fq denote a baseline variable which corresponds to the 
case where the covariatc vector x has no effect on the distribution of income. We 
assume that Y has a conditional distribution F{y\x) which depends on x through 
some real valued function A{x)=g{x, l3) which is known up to a vector /3 of un- 
known parameters. Let y Z denote " Y is distributed as Z" . As we have seen in 



if 



H{x) = g{F{x)) 




L{u\x)=ii{u\x)/ ii{l\x), B{u\x)=L{u\x) / u. 
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Section 2.2, if large values of A (a;) correspond to a more egalitarian distribution of 
income than Fq, then it is reasonable to model this as 

Y ^ h{Ya), 

for some increasing anti-starshaped function h depending on A(x). On the other 
hand, an increasing starshaped h would correspond to income being less egalitarian. 
A convenient parametric form of h is 

(3.1) Y ^ tY,"", 

where A = A(a;)> 0, and r > does not depend on x. Since h{y) — y'^(^) is 
concave for < A(a;) < 1, while convex for A{x) > 1, the model (|3.1|l with 

< A(a;) < 1 corresponds to covariates that lead to a less unequal distribution of 
income for Y than for Yq, while A{x)> 1 is the opposite case. Thus it follows from 
the results of Section 2.2 that if we use the parametrization A(a;)= expix^ (3), then 
the coefficient j3j in (3 measures how the covariate relates to inequality in the 
distribution of resources Y. 

Example 3.1. Suppose that Yq ~ Fo where Fq is the Pareto distribution Fo(2/) — 

1 — {c/uY ^ with a>l,c>0,?;>c. Then Y = tY^ has the Pareto distribution 

(3.2) F(y|^)^Fo((^)i) = l-(^)"(^\ y>\ 

T y 

where \ — ct and a{x)— a/ A{x). In this case fi{u\x) and the regression summary 
measures have simple expressions, in particular 

i(u|x) = 1- (l-u)i-'^(^'. 

When A(a;) — exp(a;-^/3) then \ogY already has a scale parameter and we set 
a — 1 without loss of generality. One strategy for estimating /3 is to temporarily 
assume that A is known and to use the maximum likelihood estimate /3(A) based on 
the distribution of log Yi, . . . , logYri. Next, in the case where {Yi,Xi), . . . , (y„, Xn) 
are i.i.d., we can use A — nmm{Yi}/{n + 1) to estimate A. Because A converges to 
A at a faster than ^/n rate, /3(A) is consistent and y/n{(3(\)—(3) is asymptotically 
normal with the covariance matrix being the inverse of the A-known information 
matrix. 

Example 3.2. Another interesting case is obtained by setting Fq equal to the 
log normal distribution $((log(y) — iiq)/(Jq), y > 0. For the scaled log normal 
transformation model we get by straightforward calculation the following explicit 
form for the conditional Lorenz curve: 

(3.3) L{u\x) = (i>{'i>-\u)-<7aA{x)). 

In this case when we choose the parametrization A[x) = exp(3;"^/3), the model 
already includes the scale parameter exp(— /3o) for logF. Thus we set fiQ — 1. To 
estimate /3 for this model we set Zi = logF^. Then Zi has a N(^a + A{xi), aQA'^{xi)) 
distribution, where a = logr and Xi — {l,Xii, . . . ,Xid)'^ ■ Because aa and a are 
unknown there are d + 3 parameters. When Yi, . . . , 1^ are independent, this gives 
the log likelihood function (leaving out the constant term) 

n _^ n 

l{a, /3, CT^) = -n log(CTo) - Y^^^f^ ^ 2^^° cxp{-2xj P){Z, - a - exp{xf (3)}^ 

i=l i=l 
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Likelihood methods will provide estimates, confidence intervals, tests and their 
properties. Software that only require the programming of the likelihood is available, 
e.g. Mathematica 5.2 and Stata 9.0. 

4. Lehmann— Cox type semip£ir£imetric models. Pgirtial likelihood 
4-1- The distribution transformation model 

Let Yb ~ -Fb be a baseline income distribution and let Y ~ F{y\x) denote the 
distribution of income for given covariate vector x. In Section 2.3 it was found that 
one way to express that F{y\x) corresponds to more equality than Fo{y) is to use 
the model 

F{y\x)= h{Fo{y)) 

for some nonnegative increasing concave transformation h depending on x with 
h{0) = and h{l) = 1. Similarly, h convex corresponds to a more egalitarian 
income. A model of the form F2(y) = h{Fi{y)) was considered for the two-sample 
case by Lehmann [17] who noted that F2{y) = F^{y) for A > was a convenient 
choice of h. For regression experiments, we consider a regression version of this 
Lehmann model which we define as 

(4.1) F(y|x) = Fo^(y) 

where A = A(a;) = g{x,(3) is a real valued parametric function and where A < 1 
or A > 1 corresponds to F{y\x) representing a more or less egalitarian distribution 
of resources than Fo{y), respectively. 

To find estimates of (3, note that if we set Ui = 1 — Fo{Yi), then Ui has the 
distribution 

H{u) = 1 - (1 - u)^('^), < M < 1 

which is the distribution of Fo{Yi) in the next subsection. Since the rank i?, of Yi 

equals N + 1 — Si, where St is the rank of 1 — Fo{Yi), wc can use rank methods, or 
Cox partial likelihood methods, to estimate f3 without knowing Fq. In fact, because 
the Cox partial likelihood is a rank likelihood and rank[l — Fo{Yi)]=rank{—Yi) , we 
can apply the likelihood in the next subsection to estimate the parameters in the 
current model provided we reverse the ordering of the Y's. 

4-2. The semiparametric generalized Pareto model 

In this section we show how the Pareto parametric regression model for income can 
be extended to a semiparametric model where the shape of the income distribution 
is completely general. This model coincides with the Cox proportional hazard model 
for which a wealth of theory and methods are available. 

We defined a regression version of the Pareto model in Example 3.1 as 

F{y\x)^l-{^y\ y>c:a,>0, 
where Ui = A^^, Aj = exp{xf(3}. This model satisfies 



(4.2) 



l-F{y\x) = {l-Fo{y)y 
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where Fo{y) = I — c/y,y > c. When Fq is an arbitrary continuous distribution on 
[0, oo), the model (4.2) for the two sample case was called the Lehmann alternative 
by Savage [23], [24] because if V satisfies model (4.1), then Y = —V satisfies model 

(4.2) . Cox [10] introduced proportional hazard models for regression experiments in 
survival analysis which also satisfy (4.2) and introduced partial likelihood methods 
that can be used to analyse such models even in the presence of censoring and time 
dependent covariates (in our case, wage dependent covariates). 

Cox introduced the model equivalent to (4.2) as a generalization of the expo- 
nential model where Fo{y) = 1 — exp(— y) and F{y\xi)=Fo{A~^y). That is, (4.2) 
is in the Cox case a semiparametric generalization of a scale model with scale pa- 
rameter Aj. However, in our case we regard (4.2) as a semiparametric shape model 
which generalizes the Pareto model, and A^ represents the degree of inequality for 
a given covariate vector Xi. The inequality measures correct for this confounding 
of shape and scale by being scale invariant. 

Note from Section 2.3 that A^ < 1 corresponds to F{y\x) more egalitarian than 
Fo{y) while A,; > 1 corresponds to Fq more egalitarian. 

The Cox [10] partial likelihood to estimate f3 for (4.2) is (see also Kalbfleisch 
and Prentice [16], page 102), 

n 

L(/3) = n{ exp(-4)/3) exp(-xffe)/3)} 

where Yj-j) is the i-th order statistic, a;(-j) is the covariate vector for the subject with 
response Y(j), and i?(y(j)) = {A: : Y(fc) > ^(i)}- Here /3=argmaxL(/3) can be found 
in many statistical packages, such as S-Plus, SAS, and STATA 9.0. These packages 
also give the standard errors of the f3j. Note that i(/3) docs not involve _Fo- 

Many estimates are available for Fq in model (4.2) in the same packages. If we 
maximize the likelihood keeping /3 = /3 fixed, we find (e.g., Kalbfleisch and Prentice 
[16], p. 116, Andersen ct al. [4], p. 483) Fo(Y(j)) = 1 — n"=iQ'j) where aj is the 
Breslow-Nelson-Aalen estimate, 

I Efcefl(r„)exp(-a:^)/3)J 

Andersen et al. [4] among others give the asymptotic properties of Fq. 
We can now give theoretical and empirical expressions for the conditional in- 
equality curves and measures. Using (4.2), we find 

(4.3) F-\u\x,)=F,\1-{1-u)^') 



and 



(4.4) ^i{u\xi)= r F-^{t\xi)dt= r Fo\1 - {1 - v)^')dv. 

Jo Jo 



We set t = Fq^{1 - (1 - v)^*) and obtain 

rS{u) 



H{u\xi) 



= A-i / " t{l-Fo{t))^^'-'dFo{t), 
Jo 
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where Si{u) — Fq ^(1 — (1 — w)'^'). To estimate ii{u\xi), we let 

i-i i-i 

b, = Z'oi^) - ^o(>^(^-i)) = n ^ (1 - n 

be the jumps of Fo{-); then 

where the sum is over j with Fo(F(j)) < 1 - (1 - w)'^'. Finally, 
i7(w|a;) — ji{u\x) / jj,{l\x), B{u\x) = L{u\x)/u, 

and 

C{u\x) = fi{u\x)/F-\u\x),I){u\x) ^ C{u\x)/u, 

where F~^{u\x) is the estimate of the conditional quantilc function obtained from 
(I4.3|l by replacing Ai with Ai and Fq with Fq. 

Remark. The methods outlined here for the Cox proportional hazard model have 
been extended to the case of ties among the responses F^, to censored data, and 
to time dependent covariates (see e.g. Cox [10], Andersen et al. [4] and Kalbfleisch 
and Prentice [16]). These extensions can be used in the analysis of the semipara- 
metric generalized Pareto model with tied wages, censored wages, and dependent 
covariates. 
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